Tractable Queries with Inequalities on Probabilistic Databases
نویسندگان
چکیده
The problem of efficient evaluation of queries on probabilistic databases has received a great attention in recent years. In this thesis, we introduce novel classes of queries with inequalities (<,≤) that admit tractable evaluation, in the context where query evaluation is #P-hard in general. We also introduce a new kind of hard query that is not captured by previous characterizations. As an application outside database theory, we show that particular tractable instances of our problem capture previously-open problems of counting vertex covers in bipartite chain and convex graphs, for which we can now easily derive efficient algorithms. Our approach to both query evaluation and counting vertex covers is based on a novel syntactical characterization of classes of k-DNFs that capture the lineage of queries from our tractable classes and that can be compiled in at most quadratic time into binary decision diagrams.
منابع مشابه
On Efficiently Evaluating Inequality Queries on Probabilistic Databases and Counting Vertex Covers
The problem of efficient evaluation of queries on probabilistic databases has received a great attention in recent years. In this paper, we introduce a class of queries with inequalities (<,≤) and self-joins that admit tractable evaluation, in the context where query evaluation is #P-hard in general. This class strictly contains the class of inequality queries recently introduced by Olteanu and...
متن کاملUsing OBDDs for Efficient Query Evaluation on Probabilistic Databases
We consider the problem of query evaluation for tuple independent probabilistic databases and Boolean conjunctive queries with inequalities but without self-joins. We approach this problem as a construction problem for ordered binary decision diagrams (OBDDs): Given a query q and a probabilistic database D, we construct in polynomial time an OBDD such that the probability of q(D) can be compute...
متن کاملEfficient Evaluation of HAVING Queries on a Probabilistic Database
We study the evaluation of positive conjunctive queries with Boolean aggregate tests (similar to HAVING queries in SQL) on probabilistic databases. Our motivation is to handle aggregate queries over imprecise data resulting from information integration or information extraction. More precisely, we study conjunctive queries with predicate aggregates using MIN, MAX, COUNT, SUM, AVG or COUNT(DISTI...
متن کاملRead-Once Functions and Query Evaluation in Probabilistic Databases
Probabilistic databases hold promise of being a viable means for large-scale uncertainty management, increasingly needed in a number of real world applications domains. However, query evaluation in probabilistic databases remains a computational challenge. Prior work on efficient exact query evaluation in probabilistic databases has largely concentrated on query-centric formulations (e.g., safe...
متن کاملTractable Query Answering Under Probabilistic Constraints
Large knowledge bases such as YAGO [SKW07] or DBpedia [BLK+09] can be used to answer queries in various domains. However, as they are automatically harvested from Web sources, they may be incomplete: important facts may be missing because they were not materialized in the original sources, or could not be extracted correctly. To mitigate this problem, approaches such as association rule mining ...
متن کامل